Biomedical Text Mining: State-of-the-Art, Open Problems and Future Challenges

نویسندگان

  • Andreas Holzinger
  • Johannes Schantl
  • Miriam Schroettner
  • Christin Seifert
  • Karin M. Verspoor
چکیده

Text is a very important type of data within the biomedical domain. For example, patient records contain large amounts of text which has been entered in a non-standardized format, consequently posing a lot of challenges to processing of such data. For the clinical doctor the written text in the medical findings is still the basis for decision making – neither images nor multimedia data. However, the steadily increasing volumes of unstructured information need machine learning approaches for data mining, i.e. text mining. This paper provides a short, concise overview of some selected text mining methods, focusing on statistical methods, i.e. Latent Semantic Analysis, Probabilistic Latent Semantic Analysis, Latent Dirichlet Allocation, Hierarchical Latent Dirichlet Allocation, Principal Component Analysis, and Support Vector Machines, along with some examples from the biomedical domain. Finally, we provide some open problems and future challenges, particularly from the clinical domain, that we expect to stimulate future research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Frontiers of biomedical text mining: current progress

It is now almost 15 years since the publication of the first paper on text mining in the genomics domain, and decades since the first paper on text mining in the medical domain. Enormous progress has been made in the areas of information retrieval, evaluation methodologies and resource construction. Some problems, such as abbreviation-handling, can essentially be considered solved problems, and...

متن کامل

Sample Chapter LNCS 8401: Interactive Knowledge Discovery and Data Mining: State-of-the-Art and Future Challenges in Biomedical Informatics

One of the grand challenges in our networked world are the large, complex, multi-dimensional and weakly structured data sets (big data) and the masses of unstructured information. These increasingly enormous amounts of data require new, efficient and user-friendly solutions for knowledge discovery. This challenge is most evident in the field of Biomedicine: the trend towards personalized medici...

متن کامل

Text Mining in Bioinformatics: Research and Application

Biomedical literatures have been increased at the exponential rate. To find the useful and needed information from such a huge data set is a daunting task for users. Text mining is a powerful tool to solve this problem. In this paper, we surveyed on text mining in Bioinformatics with emphasis on applications of text mining for bioinformatics. In this paper, the main research directions of text ...

متن کامل

BioTxtM 2016 Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining

Methods based on deep learning approaches have recently achieved state-of-the-art performance in a range of machine learning tasks and are increasingly applied to natural language processing (NLP). Despite strong results in various established NLP tasks involving general domain texts, there is only limited work applying these models to biomedical NLP. In this paper, we consider a Convolutional ...

متن کامل

Closed- and Open-loop Deep Brain Stimulation: Methods, Challenges, Current and Future Aspects

Deep brain stimulation (DBS) is known as the most effective technique in the treatment of neurodegenerative diseases, especially Parkinson disease (PD) and epilepsy. Relative healing and effective control of disease symptoms are the most significant reasons for the tangible tendency in use and development of this technology. Nevertheless, more cellular and molecular investigations are required ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014